Contact tracing and epidemics control in social networks 

Ramon Huerta 1 ' 2 , Lev S. Tsimring 1 

(1) Institute for Nonlinear Science, University of California, San Diego, La Jolla, CA 92093-0402 
(2) GNB, E.T.S. de Ingenieria Informatica, Universidad Autonoma de Madrid, 28049 Madrid (SPAIN). 

(February 8, 2008) 

A generalization of the standard susceptible-infectious-removed (SIR) stochastic model for epidemics 
in sparse random networks is introduced which incorporates contact tracing in addition to random 
screening. We propose a deterministic mean-field description which yields quantitative agreement 
with stochastic simulations on random graphs. We also analyze the role of contact tracing in 
epidemics control in small-world networks and show that its effectiveness grows as the rewiring 
probability is reduced. 
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Properties of complex networks recently attracted 
much attention in physical community Q . Although per- 
haps it was prompted by the advent of the Internet and 
World-Wide Web, the importance of this subject goes 
far beyond computer networks. Indeed, daily commute, 
power and goods traffic, wired and wireless communica- 
tion, disease spreading occur within certain physical or 
social networks. The theory of disease spreading, which 
is known as mathematical epidemiology, has a long and 
rich history (see, e.g. Q). However, until recently the epi- 
demiological studies have been mostly concerned with so- 
called mean- field description of epidemics, in which it is 
assumed that at any time the probability to get infected 
is the same for all individuals. In some other works, 
spreading of a disease on relatively simple lattices of in- 
dividuals has been studied within so called "forest-fire" 
models ||. Only recently, the studies which elucidate 
the role of underlying network structure in the disease 
spreading began to appear in the literature Q [?|] . 

Most of the epidemiological models are based on sev- 
eral simple assumptions regarding disease contracting 
and cure. In particular, the most common mechanism 
of infection is through a contact with another infec- 
tious individual, and the mechanism of recovery is ei- 
ther deterministic or purely stochastic with a certain 
typical time of recovery. In the simplest Susceptiblc- 
Infectious-Susceptible (SIS) model, a recovered individ- 
ual immediately becomes susceptible again, while in a 
more complicated Susceptible-Infectious-Removed (SIR) 
model, cured individuals become immune and effectively 
excluded from further dynamics. 

While these models give a good description of evolution 
of many common infectious diseases, they usually neglect 
the role of intelligent strategies to stop nascent epidemics. 
Few epidemiological models take into account prevention 
strategies such as, for example, mass and ring vaccina- 
tion In practice, one of the main counter-epidemics 
measures is the contact tracing, when individuals which 
have been in contact with an infected (and identified) in- 
dividual, are found and thoroughly checked. It applies, 



among others, to the treatment of sexually transmitted 
diseases, tactics of law-enforcement organizations trying 
to uncover criminal or terrorists networks, cleaning of 
computer virus infection, etc. We only are aware of one 
theoretical paper ^ where a model of this kind has been 
studied. The model |9| is based on the assumption that 
infection is a slow branching process, while the contract 
tracing occurs at a much shorter time scale. This leads to 
a familiar SIR-type model with rescaled parameters and 
similar dynamics. In this Letter we consider a more real- 
istic model in which infection and contact tracing occur 
concurrently, and their interplay determines the dynam- 
ics of the system. 

Stochastic model. We assume that the population con- 
sists of N hosts whose connections to each other form 
a fixed graph. The hosts are enumerated with index 
n = 1, N. A node n is said to have a degree k(n) if it 
is connected to k other hosts. In case of random graphs 
the degree distribution is Poissonian with a certain mean 
degree K = (k(n)). 

For simplicity we assume that there in no spontaneous 
recovery, an infectious individual can only be disinfected 
externally through screening. Immediately upon disin- 
fecting, the individual becomes traced (T) for a certain 
period of time during which its neighbors are checked 
for possible infection. After that time, the individual 
spontaneously becomes removed, and its neighbors are 
no longer traced. 

Infection S — ► I. Initially, the whole population ex- 
cept for one host is assumed to be susceptible to infec- 
tion. The probability of host infection depends on the 
state of its nearest neighbors. The infection dynamics 
is modeled as a simple contact process: if a susceptible 
node n has fc,-(n) infectious neighbors, the probability 
that it becomes infectious during a small At time inter- 
val is a ki(n)At. 

Tracing J — ► T. The process of infection elimina- 
tion consists in finding infectious hosts and then curing 
them. Hosts are being checked with certain probability 
(3 which depends on the state of its neighbors. We postu- 
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late that if an infectious host is checked, it is immediately 
cured, eliminated or at least isolated so it cannot infect 
other hosts. We introduce two non-exclusive strategies of 
checking for infectious hosts: random checking and con- 
tact tracing. Random checking means choosing an arbi- 
trary host with probability ftAi, while contact tracing 
of host n is done with probability /3tkt(n)At where fc t (n) 
is the number of neighbors of n which are in the traced 
state T. The random checking process is equivalent to 
the removal process of general epidemics . 

Removal T — > R. With certain probability 7 At, 
traced hosts are transformed into the removed state, in 
which they also cannot be infected, but they are no longer 
under observation, so they do not initiate contact tracing. 

Stochastic simulations of the described process were 
performed using an event-driven scheme fllQj . It is sig- 
nificantly superior over synchronous and asynchronous 
update schemes both in terms of accuracy and compu- 
tational speed. In the event-driven scheme we select the 
time lapsed between two consecutive events from a Pois- 
son distribution with a combined probability of all events 
(infection, tracing, recovery, etc.), then choose a node 
(each node has its own probability to be chosen depend- 
ing on its state and the states of its neighbors), and apply 
the transition from one state to another according to the 
ratio of individual transition probabilities. 

In our simulations with random graph based networks, 
we typically built networks with average degree K = 10 
and 1000 nodes. For every random graph we ran 100 sim- 
ulations starting every time from a single (but different 
in each run) infected host. Then we averaged the results 
for 50 random graphs. The time bin was At = 10~ 6 for 
most simulations. In all simulations we varied the trac- 
ing parameters ft- and ft, while the infection constant 
was set at a — 0.1, and the transition rate from T to R, 
7 = 0.5. The latter parameter is important for the effec- 
tiveness of the targeting elimination, because the longer 
a node remains in the traced state, the more probable 
it is to trace its neighboring infectious nodes. However, 
since tracing presumably bears a significant cost, an op- 
timal choice of the tracing parameters (ft, (3t, and 7) for 
a given epidemics is an important issue. 

In Fig. ^ we present the "prevalence" of epidemics 
(the fraction of infectious nodes in the whole population 
i = I/N) as a function of time for several values of ft and 
ft. When ft = ft = 0.0 we have a simple SI process, 
and all the nodes eventually get infected (thick solid line 
in Fig. Q). Other curves show the fraction of infectious 
nodes as a function of time for ft = 0.02 and different 
values of ft. The ratio a//3 r is chosen to be above the epi- 
demics threshold J2|. For ft = we obtain the classical 
SIR process with randomly removed infectives (dashed 
line in Fig. ^). The epidemic eventually saturates, and 
the fraction of infectious nodes decays exponentially. The 
lower lines display the evolution of the infection fraction 
for values of ft ranging from to 2.5 with a step value 



of 0.1. The initial (exponential) phase of the epidemics 
growth is nearly independent of ft, because the contact 
tracing process is intrinsically nonlinear (it requires the 
presence of I-T connected pairs and therefore only be- 
gins after the first infected node is randomly screened). 
As expected, the tracing process significantly reduces the 
magnitude of the epidemics (maximal value of i), but at 
large times the infection decays with the same exponen- 
tial rate as for ft = 0.0 (again, because we return to the 
linear regime at small i). The most interesting feature of 
the process at large ft > 0.35 is the presence of a second 
maximum of i which indicates a second epidemic. Due 
to this second epidemic, the percentage of the infectious 
population at large times t > 40 may actually increase 
with increase of ft. It means that the range of ft values 
from 0.4 to 0.9 are not better to control the epidemics 
than values smaller than 0.4. 
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FIG. 1. Infected population in a random graph of 1000 
nodes and K — 10 for a — 0.1,7 = 0.5. The solid thick 
line corresponds to ft = 0, ft = 0, and the dashed line 
to ft = 0.02, ft = 0. Thin lines are for ft = 0.02 and 
ft = 0.1,0.2, ...2.5. 

Mean-field equations. At the first sight, it seems that 
the mean-field approach cannot be applied to the con- 
tact tracing, since it does not take into account the 
non-uniform distribution of infection in the population. 
Nevertheless, a more sophisticated mean-field approach 
which operates not only with the mean densities of states, 
but also with the densities of links connecting nodes with 
different states, can be devised. In the derivation we fol- 
low the method of Rand |J (see also (?]]). Let us intro- 
duce the number of nodes A, the number of connected 
pairs [AB] and triples L4J3C] of nodes, where A, B and 
C stand for any of the types S, I, T, R. For example, 
the number of connected pairs of infectious and traced 
nodes is denoted [IT]. Note that [AB] = [BA] and 
each pair in [AA] is counted twice. For large N, the ra- 
tios A/N, [AB]/N, [ABC]/N approach deterministic lim- 
its which we label a, [ab], [abc], respectively. 

The dynamics of the model is described by the follow- 
ing set of rate equations 
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s = —a[si], i = a[si] — [3 r i — Pt[iT], (1) 

[ss] = -2a[ssi], (2) 

[si] = a([ssi] — [isi] — [si]) — P r [si] — /3t[sir], (3) 

[ii] = 2a([isi] + [si]) - 2/3 r [ii] - 2&[mt], (4) 



[it] = p r [ii] + t ([iiT] - [rir] - [it]) - 7 [fr] - &[ir]. (5) 

Here we used the notation r = [T]/N to avoid confusion 
between the density of traced nodes and time t. Note that 
we omit here the equations for t, [tt] , [sr] , as well as any 
combinations involving the removed state, since they do 
not affect the dynamics of the infectious population. 

The meaning of these equations is rather straightfor- 
ward. For example, the terms in the r.h.s. of the last 
equation can be explained as follows. A (p, q) pair be- 
comes [it] through random screening of the infectious 
node q in an [ii] pair, or through contact tracing of node 
q from node r in a (p, q, r) triple in the state [Ut] . On the 
other hand, we can lose an [it] pair by contact tracing of 
the p node in a [tit] triple (r, p, g), by direct tracing p by 
q, by removing of q, or by random screening of p. Other 
equations can be obtained from similar arguments. 

This set of equation is not closed, as the equations for 
the pair densities contain triple densities. We need to 
introduce a closure rule. Similarly to Refs. (|J^], we can 
use the approximation [abc] = [ab][bc]/b, which follows 
from the condition that the influence of a node on the 
state of its second neighbor in a triple is neglig ible Q. 

Using this closure rule, we arrive at the following set 



of equations 

s = —asi, (6) 
i = asi — (3 r i — PtiT, (7) 

i = (uKs — a — [3 r )i — PtiT, (8) 
s * 

f = (fop - (3 t - 7 - a-i)f + /3 r p, (9) 
i 

p = a~(2i + 2- p)i - (p r + p t f)p. (10) 



where i = [is]/ 's is the mean number of infectious neigh- 
bors per susceptible node, f = [«r]/i is the mean number 
of traced neighbors per infectious node, and p — [ii]/i is 
the mean number of infectious neighbors of an infectious 
node. Notice that the equation for [ss] dropped out as 
[ss] = Ks 2 at all times. We used the initial conditions 
a(0) = l-»o,i(0) = io,»(0) = (if-l)i 0) f(0) = 0,p(0) = 
which correspond to a small set of disconnected infec- 
tious nodes. 

During the early stage of an epidemic the contact trac- 
ing can be neglected (r = 0), and Eqs. @-© arc re- 
duced to a set of three equations for s, i, i which coincide 
with the model has been studied in . Independently 
of [3 t , the initial epidemics growth is characterized by 
the basic reproduction number Ka/(a + /3 r ). However, 
as the number of traced individuals grows, the growth 
rate is reduced and the epidemics is saturated. In Fig.0 



the dynamics of the epidemics calculated from Eqs.(|6|)- 
) are shown for different values of /3 t . As seen from 
the figure, the maximum number of infectious nodes is 
drastically reduced with increase of (3t ■ In the same fig- 
ure we show the results of direct stochastic simulations 
for fit — 0, 0.5 [[Til. The most important question is 
whether the contact tracing is capable of arresting the 
exponential growth of the epidemics before it engulfs a 
finite portion of the total population. 




FIG. 2. Evolution of the infection prevalence for different 
/3 t , mean-field model (lines) and stochastic simulations (sym- 
bols), a = 0.1, K = 10, p T = 0.02, 7 = 0.5. 

To answer this question, we consider the limit of small 
epidemics in a large populations (i,i << 1), then we 
can set s(t) = 1 and drop Eq. ([?]). We also observe that 
i/i = K — 1 and drop Eq.(Q) as well. A simple calcu- 
lation shows that the critical value of Pt at which the 
exponential growth of infection is arrested, 

^ = (a(K-l) +1 )(a(K-l)-p r ) ^ 

Pt 

For P > p cri epidemic remains small at all times, and so 
there is no major outbreak of the epidemic, the maximum 
number of infectious nodes is independent of the popu- 
lation size, and depends on the initial size of infection 
if). For the parameter values of our stochastic simula- 
tions, a = 0.1, K = 10, P r = 0.02,7 = 0.5, we obtain 
P cr = 61.6. In Fig.^J we show the maximum number of 
infectious nodes during the epidemic as a function of p t 
at different initial epidemic sizes «o- in agreement with 
the above argument, for p t < P cr the value of i m ax is 
almost independent on io until i ma x ~ 2«o at a certain 
/3 t *, after which it remains nearly constant. The value of 
PI approaches p cr for i$ — > 0. 

It is easy to see that the epidemic threshold depends 
only on the average node degree and not on the specific 
topology of the underlying graph. However, the subse- 
quent development of the epidemics should be signifi- 
cantly affected by the network structure, in particular by 
the average minimum path length and the clustering co- 
efficient pi] . Low average minimum path means that any 
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node in the graph can be exposed to infection in a short 
time after an epidemics begins. If the clustering coef- 
ficient is large, the infection propagates faster within a 
certain community but then it make it easier to trace the 
epidemics. Sparse random networks studied above rep- 
resent a particular class of networks with short average 
minimal path and small clustering coefficient. Many so- 
cial networks are characterized by a relatively large clus- 
tering coefficient while keeping the average minimal path 
low. We studied numerically the effect of the network 
structure on the contact tracing of epidemics within the 
small- world model |i~3| ] . Changing the re- wiring probabil- 
ity p allows us to scan the range of networks from regular 
(p = 0) to random (p — ► 1) through the small- world range 




FIG. 3. Maximum number of infectious nodes vs. j3t for 
different ig. Parameters are the same as in Fig.^ 

(0.001 < p < 0.1) which exhibits a short average min- 
imal path and a large clustering coefficient typical for 
many social networks. We used the same number of 
nodes and edges as for the random graph simulations, 
and fixed the parameter values at a = 0.1, (3 r = 0.02, 
and 7 = 0.5. Fig.[| shows the dependence of the max- 
imum prevalence i max on p for several different /3 t . As 
we can see, i m ax changes mostly within the SW range 
(0.001 < p < 0.1) where the clustering coefficient and 
the average path undergo large variations. 




FIG. 4. The maximum prevalence as a function of the 
rewiring probability p. Effectiveness of the contact tracing 
becomes very significant in the small-world regime. 

In this Letter we studied the role of contact tracing as 
a part of the epidemics control strategy in complex net- 
works. We demonstrated that by applying this strategy, 
a major outbreak can be significantly reduced or even 
eliminated at a small additional cost. Based on the pair 
correlation approach due to Rand we developed the 
mean-field model of contact tracing for the case of ran- 
dom graphs. We also studied the influence of network 
topology on the contact tracing using the small-world 
model with variable re-wiring probability p, and found 
that its effectiveness grows as the reviring probability is 
reduced. The main change occurs within the small-world 
regime at p ~ 10~ 2 . 
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